Reinforcement Learning using Kernel-Based Stochastic Factorization

نویسندگان

André da Motta Salles Barreto

Doina Precup

Joelle Pineau

چکیده

Kernel-based reinforcement-learning (KBRL) is a method for learning a decision policy from a set of sample transitions which stands out for its strong theoretical guarantees. However, the size of the approximator grows with the number of transitions, which makes the approach impractical for large problems. In this paper we introduce a novel algorithm to improve the scalability of KBRL. We resort to a special decomposition of a transition matrix, called stochastic factorization, to fix the size of the approximator while at the same time incorporating all the information contained in the data. The resulting algorithm, kernel-based stochastic factorization (KBSF), is much faster but still converges to a unique solution. We derive a theoretical upper bound for the distance between the value functions computed by KBRL and KBSF. The effectiveness of our method is illustrated with computational experiments on four reinforcement-learning problems, including a difficult task in which the goal is to learn a neurostimulation policy to suppress the occurrence of seizures in epileptic rat brains. We empirically demonstrate that the proposed approach is able to compress the information contained in KBRL’s model. Also, on the tasks studied, KBSF outperforms two of the most prominent reinforcement-learning algorithms, namely least-squares policy iteration and fitted Q-iteration.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On-line Reinforcement Learning Using Incremental Kernel-Based Stochastic Factorization

Kernel-based stochastic factorization (KBSF) is an algorithm for solving reinforcement learning tasks with continuous state spaces which builds a Markov decision process (MDP) based on a set of sample transitions. What sets KBSF apart from other kernel-based approaches is the fact that the size of its MDP is independent of the number of transitions, which makes it possible to control the trade-...

متن کامل

On-line Reinforcement Learning Using Incremental Kernel-Based Stochastic Factorization SUPPLEMENTARY MATERIAL

This is the supplementary material for the paper entitled “On-line Reinforcement Learning Using Incremental Kernel-Based Stochastic Factorization” [2]. It contains the details of our theoretical developments that could not be included in the paper due to space constraints. This material should be read in conjunction with the main paper. 1 Preliminaries • Similarly to Ormoneit and Sen [3], we de...

متن کامل

Practical Kernel-Based Reinforcement Learning

Kernel-based reinforcement learning (KBRL) stands out among approximate reinforcement learning algorithms for its strong theoretical guarantees. By casting the learning problem as a local kernel approximation, KBRL provides a way of computing a decision policy which is statistically consistent and converges to a unique solution. Unfortunately, the model constructed by KBRL grows with the number...

متن کامل

An Expectation-Maximization Algorithm to Compute a Stochastic Factorization From Data

When a transition probability matrix is represented as the product of two stochastic matrices, swapping the factors of the multiplication yields another transition matrix that retains some fundamental characteristics of the original. Since the new matrix can be much smaller than its precursor, replacing the former for the latter can lead to significant savings in terms of computational effort. ...

متن کامل

Online Kernel Matrix Factorization

The problem of efficiently applying a kernel-induced feature space factorization to a largescale data sets is addressed in this thesis. Kernel matrix factorization methods have showed good performances solving machine learning and data analysis problems. However, the present growth of the amount of information available implies the problems can not be solved with conventional methods, due their...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2011

Reinforcement Learning using Kernel-Based Stochastic Factorization

نویسندگان

چکیده

منابع مشابه

On-line Reinforcement Learning Using Incremental Kernel-Based Stochastic Factorization

On-line Reinforcement Learning Using Incremental Kernel-Based Stochastic Factorization SUPPLEMENTARY MATERIAL

Practical Kernel-Based Reinforcement Learning

An Expectation-Maximization Algorithm to Compute a Stochastic Factorization From Data

Online Kernel Matrix Factorization

عنوان ژورنال:

اشتراک گذاری